CP 




Page 1 of 7 





ENTERED 



RAW SEQUENCE LISTING DATE : 01/09/2003 

PATENT APPLICATION: US/09/308,207 TIME: 11:14:33 

Input Set : N:\vernette\US09308207.raw 
Output Set: N:\CRF4\01092003\l308207.raw 

SEQUENCE LISTING 

1 (1) GENERAL INFORMATION: 

2 (i) APPLICANT: MARIA DIAZ-TORRES ET AL . 

3 (ii) TITLE OF INVENTION: METHOD FOR THE RECOMBINANT 

4 PRODUCTION OF 1,3 PROPANEDIOL 

5 (iii) NUMBER OF SEQUENCES: 68 

6 (iv) CORRESPONDENCE ADDRESS: 

7 (A) ADDRESSEE: Genencor International, Inc. 

8 (B) STREET: 4 Cambridge Place 

9 1870 South Winton road 

10 (C) CITY: Rochester 

11 (D) STATE: NY 

12 (E) COUNTRY: U.S. A 

13 (F) ZIP: 14618 

14 (v) COMPUTER READABLE FORM: 

15 (A) MEDIUM TYPE: Diskette 

16 {B) COMPUTER: IBM Compatible 

17 <C) OPERATING SYSTEM: Windows 

18 (D) SOFTWARE: FastSEQ for Windows Version 2.0b 

19 (vi) CURRENT APPLICATION DATA: 
C — > 20 (A) APPLICATION NUMBER: US/09/308 , 207 
C — > 21 (B) FILING DATE: 13-May-1999 

22 (C) CLASSIFICATION: 

23 (vii) PRIOR APPLICATION DATA: 

24 (A) APPLICATION NUMBER: 60/030,601 

25 <B) FILING DATE: 13-NOV-1996 
2 6 (viii) ATTORNEY /AGENT INFORMATION: 

27 (A) NAME: Glaister, Debra 

28 (B) REGISTRATION NUMBER: 33,888 

29 (C) REFERENCE/DOCKET NUMBER: GC 369-2 

30 (ix) TELECOMMUNICATION INFORMATION: 

31 (A) TELEPHONE: 650-864-7620 

32 (B) TELEFAX: 650-845-6504 

33 (C) TELEX: 

34 (2) INFORMATION FOR SEQ ID NO: 1: 

35 (i) SEQUENCE CHARACTERISTICS: 

36 (A) LENGTH: 1668 base pairs 

37 (B) TYPE: nucleic acid 

38 (C) STRANDEDNESS: single 

39 (D) TOPOLOGY: linear 
4 0 (ii) MOLECULE TYPE: DNA (genomic) 

41 (iii) HYPOTHETICAL: NO 

42 (iv) ANTI-SENSE: NO 
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77 
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79 
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81 

82 

83 

84 

85 

86 

87 

88 

89 

90 

91 

92 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: DHABI 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATGAAAAGAT CAAAACGATT TGCAGTACTG GCCCAGCGCC CCGTCAATCA GGACGGGCTG 60 

ATTGGCGAGT GGCCTGAAGA GGGGCTGATC GCCATGGACA GCCCCTTTGA CCCGGTCTCT 120 

TCAGTAAAAG TGGACAACGG TCTGATCGTC GAACTGGACG GCAAACGCCG GGACCAGTTT 180 

GACATGATCG ACCGATTTAT CGCCGATTAC GCGATCAACG TTGAGCGCAC AGAGCAGGCA 24 0 

ATGCGCCTGG AGGCGGTGGA AATAGCCCGT ATGCTGGTGG ATATTCACGT CAGCCGGGAG 300 

GAGATCATTG CCATCACTAC CGCCATCACG CCGGCCAAAG CGGTCGAGGT GATGGCGCAG 360 

ATGAACGTGG TG GAG AT GAT GATGGCGCTG CAGAAGATGC GTGCCCGCCG GACCCCCTCC 4 20 

AACCAGTGCC ACGTCACCAA TCTCAAAGAT AATCCGGTGC AGATTGCCGC TGACGCCGCC 4 80 

GAGGCCGGGA TCCGCGGCTT CTCAGAACAG GAGACCACGG TCGGTATCGC GCGCTACGCG 54 0 

CCGTTTAACG CCCTGGCGCT GTTGGTCGGT TCGCAGTGCG GCCGCCCCGG CGTGTTGACG 600 

CAGTGCTCGG TGGAAGAGGC CACCGAGCTG GAGCTGGGCA TGCGTGGCTT AACCAGCTAC 660 

GCCGAGACGG TGTCGGTCTA CGGCACCGAA GCGGTATTTA CCGACGGCGA TGATACGCCG 720 

TGGTCAAAGG CGTTCCTCGC CTCGGCCTAC GCCTCCCGCG GGTTGAAAAT GCGCTACACC 780 

TCCGGCACCG GATCCGAAGC GCTGATGGGC TATTCGGAGA GCAAGTCGAT GCTCTACCTC 84 0 

GAATCGCGCT GCATCTTCAT TACTAAAGGC GCCGGGGTTC AGGGACTGCA AAACGGCGCG 900 

GTGAGCTGTA TCGGCATGAC CGGCGCTGTG CCGTCGGGCA TTCGGGCGGT GCTGGCGGAA 960 

AACCTGATCG CCTCTATGCT CGACCTCGAA GTGGCGTCCG CCAACGACCA GACTTTCTCC 1020 

CACTCGGATA TTCGCCGCAC CGCGCGCACC CTGATGCAGA TGCTGCCGGG CACCGACTTT 1080 

ATTTTCTCCG GCTACAGCGC GGTGCCGAAC TACGACAACA TGTTCGCCGG CTCGAACTTC 114 0 

GATGCGGAAG ATTTTGATGA TTACAACATC CTGCAGCGTG ACCTGATGGT TGACGGCGGC 1200 

CTGCGTCCGG TGACCGAGGC GGAAACCATT GCCATTCGCC AGAAAGCGGC GCGGGCGATC 12 60 

CAGGCGGTTT TCCGCGAGCT GGGGCTGCCG CCAATCGCCG AC GAG GAG GT GGAGGCCGCC 1320 

ACCTACGCGC ACGGCAGCAA CGAGATGCCG CCGCGTAACG TGGTGGAGGA TCTGAGTGCG 1380 

GTGGAAGAGA TGATGAAGCG CAACATCACC GGCCTCGATA TTGTCGGCGC GCTGAGCCGC 14 40 

AGCGGCTTTG AGGATATCGC CAGCAATATT CTCAATATGC TGCGCCAGCG GGTCACCGGC 1500 

GATTACCTGC AGACCTCGGC CATTCTCGAT CGGCAGTTCG AGGTGGTGAG TGCGGTCAAC 1560 

GACATCAATG ACTATCAGGG GCCGGGCACC GGCTATCGCA TCTCTGCCGA ACGCTGGGCG 1620 

GAGATCAAAA ATATTCCGGG CGTGGTTCAG CCCGACACCA TTGAATAA 1668 
INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: DHAB2 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GTGCAACAGA CAACCCAAAT TCAGCCCTCT TTTACCCTGA AAACCCGCGA GGGCGGGGTA 60 

GCTTCTGCCG ATGAACGCGC CGATGAAGTG GTGATCGGCG TCGGCCCTGC CTTCGATAAA 120 

CACCAGCATC ACACTCTGAT CGATATGCCC CATGGCGCGA TCCTCAAAGA GCTGATTGCC 180 

GGGGTGGAAG AAGAGGGGCT TCACGCCCGG GTGGTGCGCA TTCTGCGCAC GTCCGACGTC 240 

TCCTTTATGG CCTGGGATGC GGCCAACCTG AGCGGCTCGG ' GGATCGGCAT CGGTATCCAG 300 

TCGAAGGGGA CCACGGTCAT CCATCAGCGC GATCTGCTGC CGCTCAGCAA CCTGGAGCTG 360 

TTCTCCCAGG CGCCGCTGCT GACGCTGGAG ACCTACCGGC AGATTGGCAA AAACGCTGCG 4 20 

CGCTATGCGC GCAAAGAGTC ACCTTCGCCG GTGCCGGTGG TGAACGATCA GATGGTGCGG 480 
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93 CCGAAATTTA TGGCCAAAGC CGCGCTATTT CATATCAAAG AGACCAAACA TGTGGTGCAG 54 0 

94 GACGCCGAGC CCGTCACCCT GCACATCGAC TTAGTAAGGG AGTGA 585 

96 (2) INFORMATION FOR SEQ ID NO: 3: 

97 (i) SEQUENCE CHARACTERISTICS: 

98 (A) LENGTH: 426 base pairs 

99 (B) TYPE: nucleic acid 

100 (C) STRANDEDNESS : single 

101 (D) TOPOLOGY: linear 

102 (ii) MOLECULE TYPE: DNA (genomic) 

103 (vi) ORIGINAL SOURCE: 

104 (A) ORGANISM: DHAB3 

105 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

106 ATGAGCGAGA AAACCATGCG CGTGCAGGAT TATCCGTTAG CCACCCGCTG CCCGGAGCAT 60 

107 ATCCTGACGC CTACCGGCAA ACCATTGACC GATATTACCC TCGAGAAGGT GCTCTCTGGC 120 

108 GAGGTGGGCC CGCAGGATGT GCGGATCTCC CGCCAGACCC TTGAGTACCA GGCGCAGATT 180 

109 GCCGAGCAGA TGCAGCGCCA TGCGGTGGCG CGCAATTTCC GCCGCGCGGC GGAGCTTATC 24 0 

110 GCCATTCCTG ACGAGCGCAT TCTGGCTATC TATAACGCGC TGCGCCCGTT CCGCTCCTCG 300 

111 CAGGCGGAGC TGCTGGCGAT CGCCGACGAG CTGGAGCACA CCTGGCATGC GACAGTGAAT 360 

112 GCCGCCTTTG TCCGGGAGTC GGCGGAAGTG TATCAGCAGC GGCATAAGCT GCGTAAAGGA 4 20 

113 AGCTAA 426 

115 (2) INFORMATION FOR SEQ ID NO: 4: 

116 (i) SEQUENCE CHARACTERISTICS: 

117 (A) LENGTH: 1164 base pairs 

118 (B) TYPE: nucleic acid 

119 (C) STRANDEDNESS: single 

120 (D) TOPOLOGY: linear 

121 (ii) MOLECULE TYPE: DNA (genomic) 

122 (vi) ORIGINAL SOURCE: 

123 (A) ORGANISM: DHAT 

124 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

125 ATGAGCTATC GTATGTTTGA TTATCTGGTG CCAAACGTTA ACTTTTTTGG CCCCAACGCC 60 

126 ATTTCCGTAG TCGGCGAACG CTGCCAGCTG CTGGGGGGGA AAAAAGCCCT GCTGGTCACC 120 

127 GACAAAGGCC TGCGGGCAAT TAAAGATGGC GCGGTGGACA AAACCCTGCA TTATCTGCGG 180 

128 GAGGCCGGGA TCGAGGTGGC GATCTTTGAC GGCGTCGAGC CGAACCCGAA AGACACCAAC 24 0 

129 GTGCGCGACG GCCTCGCCGT GTTTCGCCGC GAACAGTGCG ACATCATCGT CACCGTGGGC 300 

130 GGCGGCAGCC CGCACGATTG CGGCAAAGGC ATCGGCATCG CCGCCACCCA TGAGGGCGAT 360 

131 CTGTACCAGT ATGCCGGAAT CGAGACCCTG ACCAACCCGC TGCCGCCTAT CGTCGCGGTC 420 

132 AATACCACCG CCGGCACCGC CAGCGAGGTC ACCCGCCACT GCGTCCTGAC CAACACCGAA 480 

133 ACCAAAGTGA AGTTTGTGAT CGTCAGCTGG CGCAAACTGC CGTCGGTCTC TATCAACGAT 54 0 

134 CCACTGCTGA TGATCGGTAA ACCGGCCGCC CTGACCGCGG CGACCGGGAT GGATGCCCTG 600 

135 ACCCACGCCG TAGAGGCCTA TATCTCCAAA GACGCTAACC CGGTGACGGA CGCCGCCGCC 660 

136 ATGCAGGCGA TCCGCCTCAT CGCCCGCAAC CTGCGCCAGG CCGTGGCCCT CGGCAGCAAT 720 

137 CTGCAGGCGC GGGAAAACAT GGCCTATGCT TCTCTGCTGG CCGGGATGGC TTTCAATAAC 780 

138 GCCAACCTCG GCTACGTGCA CGCCATGGCG CACCAGCTGG GCGGCCTGTA CGACATGCCG 84 0 

139 CACGGCGTGG CCAACGCTGT CCTGCTGCCG CATGTGGCGC GCTACAACCT GATCGCCAAC 900 
14 0 CCGGAGAAAT TCGCCGATAT CGCTGAACTG ATGGGCGAAA ATATCACCGG ACTGTCCACT 960 
141 CTCGACGCGG CGGAAAAAGC CATCGCCGCT ATCACGCGTC TGTCGATGGA TATCGGTATT 1020 
14 2 CCGCAGCATC TGCGCGATCT GGGGGTAAAA GAGGCCGACT TCCCCTACAT GGCGGAGATG 1080 
14 3 GCTCTAAAAG ACGGCAATGC GTTCTCGAAC CCGCGTAAAG GCAACGAGCA GGAGATTGCC 114 0 
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14 4 GCGATTTTCC GCCAGGCATT CTGA 1164 

146 (2) INFORMATION FOR SEQ ID NO: 5: 

147 (i) SEQUENCE CHARACTERISTICS: 

148 (A) LENGTH: 1380 base pairs 

149 (B) TYPE: nucleic acid 

150 (C) STRANDEDNESS: single 

151 (D) TOPOLOGY: linear 

152 (ii) MOLECULE TYPE: DNA (genomic) 

153 (vi) ORIGINAL SOURCE: 

154 (A) ORGANISM: GPD1 

155 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

156 CTTTAATTTT CTTTTATCTT ACTCTCCTAC ATAAGACATC AAGAAACAAT TGTATATTGT 60 

157 ACACCCCCCC CCTCCACAAA CACAAATATT GATAATATAA AGATGTCTGC TGCTGCTGAT 120 

158 AGATTAAACT TAACTTCCGG CCACTTGAAT GCTGGTAGAA AGAGAAGTTC CTCTTCTGTT 180 

159 TCTTTGAAGG CTGCCGAAAA GCCTTTCAAG GTTACTGTGA TTGGATCTGG TAACTGGGGT 24 0 

160 ACT AC TAT TG CCAAGGTGGT TGCCGAAAAT TGTAAGGGAT ACCCAGAAGT TTTCGCTCCA 300 

161 ATAGTACAAA TGTGGGTGTT CGAAGAAGAG ATCAATGGTG AAAAATTGAC TGAAATCATA 360 

162 AATACTAGAC ATCAAAACGT GAAATACTTG CCTGGCATCA CTCTACCCGA CAATTTGGTT 4 20 

163 GCTAATCCAG ACTTGATTGA TTCAGTCAAG GATGTCGACA TCATCGTTTT CAACATTCCA 4 80 

164 CATCAATTTT TGCCCCGTAT CTGTAGCCAA TTGAAAGGTC ATGTTGATTC ACACGTCAGA 54 0 

165 GCTATCTCCT GTCTAAAGGG TTTTGAAGTT GGTGCTAAAG GTGTCCAATT GCTATCCTCT 600 

166 TACATCACTG AGGAACTAGG TATTCAATGT GGTGCTCTAT CTGGTGCTAA CATTGCCACC 660 

167 GAAGTCGCTC AAGAACACTG GTCTGAAACA ACAGTTGCTT ACCACATTCC AAAGGATTTC 720 

168 AGAGGCGAGG GCAAGGACGT CGACCATAAG GTTCTAAAGG CCTTGTTCCA CAGACCTTAC 780 

169 TTCCACGTTA GTGTCATCGA AGATGTTGCT GGTATCTCCA TCTGTGGTGC TTTGAAGAAC 84 0 

170 GTTGTTGCCT TAGGTTGTGG TTTCGTCGAA GGTCTAGGCT GGGGTAACAA CGCTTCTGCT 900 

171 GCCATCCAAA GAGTCGGTTT GGGTGAGATC ATCAGATTCG GTCAAATGTT TTTCCCAGAA 960 

172 TCTAGAGAAG AAACATACTA CCAAGAGTCT GCTGGTGTTG CTGATTTGAT CACCACCTGC 1020 

173 GCTGGTGGTA GAAACGTCAA GGTTGCTAGG CTAATGGCTA CTTCTGGTAA GGACGCCTGG 1080 

174 GAATGTGAAA AGGAGTTGTT GAATGGCCAA TCCGCTCAAG GTTTAATTAC CTGCAAAGAA 114 0 

175 GTTCACGAAT GGTTGGAAAC ATGTGGCTCT GTCGAAGACT TCCCATTATT * TGAAGCCGTA 1200 
17 6 TACCAAATCG TTTACAACAA CTACCCAATG AAGAACCTGC CGGACATGAT TGAAGAATTA 12 60 
177 GATCTACATG AAGATTAGAT TTATTGGAGA AAGATAACAT ATCATACTTC CCCCACTTTT 1320 

17 8 TTCGAGGCTC TTCTATATCA TATTCATAAA TTAGCATTAT GTCATTTCTC ATAACTACTT 1380 

180 (2) INFORMATION FOR SEQ ID NO: 6: 

181 (i) SEQUENCE CHARACTERISTICS: 

182 (A) LENGTH: 2946 base pairs 
•183 (B) TYPE: nucleic acid 

184 (C) STRANDEDNESS: single 

185 (D) TOPOLOGY: linear 

18 6 (ii) MOLECULE TYPE: DNA (genomic) 

187 (vi) ORIGINAL SOURCE: 

188 (A) ORGANISM: G-PD2 

189 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

190 GAATTCGAGC CTGAAGTGCT GATTACCTTC AGGTAGACTT CATCTTGACC CATCAACCCC 60 

191 AGCGTCAATC CTGCAAATAC ACCACCCAGC AGCACTAGGA T GAT AG AG AT AATATAGTAC 120 

192 GTGGTAACGC TTGCCTCATC ACCTACGCTA TGGCCGGAAT CGGCAACATC CCTAGAATTG 180 

193 AGTACGTGTG ATCCGGATAA CAACGGCAGT GAATATATCT TCGGTATCGT AAAGATGTGA 24 0 

194 TATAAGATGA TGTATACCCA AT GAG GAG CG CCTGATCGTG ACCTAGACCT TAGTGGCAAA 300 
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195 


AACGACATAT 


CTATTATAGT 


GGGGAGAGTT 


TCGTGCAAAT 


*¥\ TV VI TV ✓"I T\ y"*1 TV 

AACAGACGCA 


TV /"" * TV TV y^t m TV 

GCAGCAAGTA 


360 


196 


ACTGTGACGA 


TATCAACTCT 


TTTTTTATTA 


TGTAATAAGC 


AAACAAGCAC 


GAATGGGGAA 


420 


197 


AGCCTATGTG 


CAATCACCAA 


GGTCGTCCCT 


TTTTTCCCAT 


TTGCTAATTT 


AGAATTTAAA 


480 


198 


GAAACCAAAA 


GAATGAAGAA 


AGAAAACAAA 


TACTAGCCCT 


AACCCTGACT 


TCGTTTCTAT 


540 


199 


GATAATACCC 


TGCTTTAATG 


AACGGTATGC 


CCTAGGGTAT 


ATCTCACTCT 


GTACGTTACA 


600 


200 


AACTCCGGTT 


ATTTTATCGG 


AACATCCGAG 


CACCCGCGCC 


TTCCTCAACC 


CAGGCACCGC 


660 


201 


CCCAGGTAAC 


CGTGCGCGAT 


GAGCTAATCC 


T GAG C CATC A 


CCCACCCCAC 


CCGTTGATGA 


720 


202 


CAGCAATTCG 


GGAGGGCGAA 


AATAAAACTG 


GAGCAAGGAA 


TT AC CATC AC 


CGTCACCATC 


780 


203 


ACCATCATAT 


CGCCTTAGCC 


TCTAGCCATA 


GCCATCATGC 


AAGCGTGTAT 


CTTCTAAGAT 


840 


204 


TCAGTCATCA 


TCATTACCGA 


GTTTGTTTTC 


CTTCACATGA 


TGAAGAAGGT 


TTGAGTATGC 


900 


205 


TCGAAACAAT 


AAGACGACGA 


TGGCTCTGCC 


ATTGGTTATA 


TTACGCTTTT 


GCGGCGAGGT 


960 


206 


GCCGATGGGT 


TGCTGAGGGG 


AAGAGTGTTT 


AGCTTACGGA 


CCTATTGCCA 


TTGTTATTCC 


1020 


207 


GATTAATCTA 


TTGTTCAGCA 


GCTCTTCTCT 


ACCCTGTCAT 


TCTAGTATTT 


T T T T T T T T T T 


1080 


208 


TTTTTGGTTT 


TACTTTTTTT 


TCTTCTTGCC 


TTTTTTTCTT 


GTTACTTTTT 


TTCTAGTTTT 


1140 


209 


TTTTCCTTCC 


ACTAAGCTTT 


TTCCTTGATT 


TATCCTTGGG 


TTCTTCTTTC 


TACTCCTTTA 


1200 


210 


GATTTTTTTT 


T TAT AT ATT A 


ATTTTTAAGT 


TTATGTATTT 


TGGTAGATTC 


AATTCTCTTT 


1260 


211 


CCCTTTCCTT 


TTCCTTCGCT 


CCCCTTCCTT 


ATCAATGCTT 


GCTGTCAGAA 


GATTAACAAG 


1320 


212 


ATACACATTC 


CTTAAGCGAA 


CGCATCCGGT 


GTTATATACT 


CGTCGTGCAT 


ATAAAATTTT 


1380 


213 


GCCTTCAAGA 


TCTACTTTCC 


TAAGAAGATC 


ATTATTACAA 


ACACAACTGC 


ACTCAAAGAT 


1440 


214 


GACTGCTCAT 


ACTAATATCA 


AACAGCACAA 


ACACTGTCAT 


GAGGACCATC 


CTATCAGAAG 


1500 


215 


ATCGGACTCT 


GCCGTGTCAA 


TTGTACATTT 


GAAACGTGCG 


CCCTTCAAGG 


TTACAGTGAT 


1560 


216 


TGGTTCTGGT 


AACTGGGGGA 


CCACCATCGC 


CAAAGTCATT 


GCGGAAAACA 


CAGAATTGCA 


1620 


217 


TTCCCATATC 


TTCGAGCCAG 


AGGTGAGAAT 


GTGGGTTTTT 


GATGAAAAGA 


TCGGCGACGA 


1680 


218 


AAATCTGACG 


GATATCATAA 


ATACAAGACA 


CCAGAACGTT 


AAATATCTAC 


CCAATATTGA 


1740 


219 


CCTGCCCCAT 


AATCTAGTGG 


CCGATCCTGA 


TCTTTTACAC 


TCCATCAAGG 


GTGCTGACAT 


1800 


220 


CCTTGTTTTC 


AACATCCCTC 


ATCAATTTTT 


ACCAAACATA 


GTCAAACAAT 


TGCAAGGCCA 


1860 


221 


CGTGGCCCCT 


CATGTAAGGG 


CCATCTCGTG 


TCTAAAAGGG 


TTCGAGTTGG 


GCTCCAAGGG 


1920 


222 


TGTGCAATTG 


CTATCCTCCT 


ATGTTACTGA 


TGAGTTAGGA 


ATCCAATGTG 


GCGCACTATC 


1980 


223 


TGGTGCAAAC 


TTGGCACCGG 


AAGTGGCCAA 


GGAGCATTGG 


TCCGAAACCA 


CCGTGGCTTA 


2040 


224 


CCAACTACCA 


AAGGATTATC 


AAGGTGATGG 


CAAGGATGTA 


GATCATAAGA 


TTTTGAAATT 


2100 


225 


GCTGTTCCAC 


AGACCTTACT 


TCCACGTCAA 


TGTCATCGAT 


GATGTTGCTG 


GTATATCCAT 


2160 


226 


TGCCGGTGCC 


TTGAAGAACG 


TCGTGGCACT 


TGCATGTGGT 


TTCGTAGAAG 


GTATGGGATG 


2220 


227 


GGGTAACAAT 


GCCTCCGCAG 


CCATTCAAAG 


GCTGGGTTTA 


GGTGAAATTA 


TCAAGTTCGG 


2280 


,228 


TAGAATGTTT 


TTCCCAGAAT 


CCAAAGTCGA 


GACCTACTAT 


CAAGAATCCG 


CTGGTGTTGC 


2340 


229 


AGATCTGATC 


ACCACCTGCT 


CAGGCGGTAG 


AAACGTCAAG 


GTTGCCACAT 


ACATGGCCAA 


2400 


230 


GACCGGTAAG 


TCAGCCTTGG 


AAGCAGAAAA 


GGAATTGCTT 


AACGGTCAAT 


CCGCCCAAGG 


2460 


231 


GATAATCACA 


TGCAGAGAAG 


TTCACGAGTG 


GCTACAAACA 


TGTGAGTTGA 


CCCAAGAATT 


2520 


232 


CCCAATTATT 


CGAGGCAGTC 


TACCAGATAG 


TCTACAACAA 


CGTCCGCATG 


GAAGACCTAC 


2580 


233 


CGGAGATGAT 


TGAAGAGCTA 


GACATCGATG 


ACGAATAGAC 


ACTCTCCCCC 


CCCCTCCCCC 


2640 


234 


TCTGATCTTT 


CCTGTTGCCT 


CTTTTTCCCC 


CAACCAATTT 


ATCATTATAC 


ACAAGTTCTA 


2700 


235 


CAACTACTAC 


TAGTAACATT 


ACTACAGTTA 


TTATAATTTT 


CTATTCTCTT 


TTTCTTTAAG 


2760 


236 


AATCTATCAT 


TAACGTTAAT 


TTCTATATAT 


ACATAACTAC 


CAT TAT AC AC 


GCTATTATCG 


2820 


237 


TTTACATATC 


ACATCACCGT 


TAATGAAAGA 


TACGACACCC 


TGTACACTAA 


CACAATTAAA 


2880 


238 


TAATCGCCAT 


AACCTTTTCT 


GTTATCTATA 


GCCCTTAAAG 


CTGTTTCTTC 


GAGCTTTTCA 


2940 


239 


CTGCAG 












2946 


241 (2) 


INFORMATION 


FOR SEQ ID 


NO : 7 : . 











242 (i) SEQUENCE CHARACTERISTICS: 

243 (A) LENGTH: 3178 base pairs 

244 (B) TYPE: nucleic acid 



file://C:\CRF4\Outhold\VsrI308207.htm 



1/9/03 



Page 6 of 7 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/308,207 



DATE: 01/09/2003 
TIME: 11:14:34 



Input Set : N:\vernette\US09308207.raw 
Output Set: N:\CRF4\01092003\l308207.raw 



:20 M: 

:21 M: 

; 1773 

:1858 

; 1885 

;1912 

: 1937 

:1962 

:1993 

:2063 

;2181 



220 C: 

220 C; 

M:246 

M:246 

M:246 

M:246 

M:246 

M:246 

M:246 

M:246 

M:336 



Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 



[(A) APPLICATION NUMBER:] 
[ (B) FILING DATE: ] 



Invalid value of Alpha Sequence Header Field, 

Invalid value of Alpha Sequence Header Field, 

Invalid value of Alpha Sequence Header Field, 

Invalid value of Alpha Sequence Header Field, 

Invalid value of Alpha Sequence Header Field, 

Invalid value of Alpha Sequence Header Field, 

Invalid value of Alpha Sequence Header Field, 

Invalid value of Alpha Sequence Header Field, 



[MOLECULE TYPE 
[MOLECULE TYPE 
[MOLECULE TYPE 
[MOLECULE TYPE 
[MOLECULE TYPE 
[MOLECULE TYPE 
[MOLECULE TYPE 
[MOLECULE TYPE 



SeqNo=59 
SeqNo=60 
SeqNo=61 
SeqNo=62 
SeqNo=63 
SeqNo=64 
SeqNo=65 
SeqNo=67 



Invalid Amino Acid Number in Coding Region, SEQ ID: 68 
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